
- 63 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Levy, Gary 

Clark, David A. 

(ii) TITLE OF INVENTION: Method; 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BERESKIN & 

(B) STREET: 40 King Street 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) ZIP: M5H 3Y3 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy di 

(B) COMPUTER: IBM PC compat 

(C) OPERATING SYSTEM: PC-DDS 

(D) SOFTWARE: Patentln Rel 



(vi) CURRENT APPLICATION DATA 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 15-NOV-l 

(C) CLASSIFICATION: 



(viii) ATTORNEY / AGENT INFORMATION 

(A) NAME: Gravelle, Michel 

(B) REGISTRATION NUMBER: 

(C) REFERENCE /DOCKET NUMBER: 




of Modulating Immune Coagulation 
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ible 
/MS-DOS 
Base #1.0, Version #1.30 



/442,143 
9 



Lne 
,261 

9579-014 



4 ) 



(ix) TELECOMMUNICATION INFORMATI )N 

(A) TELEPHONE: (416) 364-7 ill 

(B) TELEFAX: (416) 361-139B 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4630 base pair^s 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nuclei|c acid 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ lb NO:l: 

GATCTAGGGT TGGAAGCCAG GTCTCCTGAG TAT 3CGAGAA 

TAAAGAGTCT GCCAACATTT TGAGAATGTG AAT W3GATTT 

ACAGAAAAGT CATAGGAAAT CAGGTTAAAG ACA 1 AAATAT 

TTAAGTAATA CAATAAAACA TTTAGATTTT TGC( :CATGTC 

TAAAGCAAAA AAACCCTTTT TAAACAAGAA ATC TATGAG 

TTAAAAGGAG GTGGTTTCTC TAACTGAAGC TGT1 HCTCTT 



TAAATACAGT 
GGCTAAAATT 
GAGATAGGCT 
AGTCATTTTG 
ATGTCAATAT 
TCCTGCCTTC 



CATGGAAGTG 
AAGGGGATAT 
ACAGAGTGTT 
AAATTATTTT 
GCAAAACAAA 
AGCCTCTGAA 



60 
120 
180 
240 
300 
360 
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GAGAAAGTTA GAAAACTATT ATCATTAftTG CTACATGTTT TGAACAAGCT GATATACCAA 420 

GTGGCCCAGA GAGCAGGTAG AAGAACCAGC GTGGAGACAG AAAGCAAGAG GCCCGCCTGC 480 

CAGGGCTACC TGCAGAAAGA AAGGGCAflAG ATGCTGTAGG CAAGAGAAGT TCAGGACAGA 540 

CACTGGCATA GCTCAAAGAT TCACATTTpA GCAGCTGTGG AAGATGACAG TACAATTACC 600 

AAAATGTCGA AGGGCAAAGG AGGCAGCTAC TGGTTTTGAT GAAAGACAAT TATGTCCTTT 660 

TAAATGGGTC TTAGACATTT AGACATTTAT ATACACTATG CTACGGACAA AGGAATAGAA 720 

AGTAGCACTT TTTTCTCCAC TAGTTTTCTff CTCTTTTTCA AGTAGATGAA GCAAAAGTCA 780 

ACTGCAATAG TCAGAAAGCT GTACTTTGTT ACACTTAGAA ACTTCTAAAA GTGCTTAAGA 840 

TTTCACCTGA AAGTCCAACA TGAAGAAAATl ACAGGCTCCC CAATGCCCCA TTC TAAGAAG 900 

GAAAAAGGAC CATTTTCATT TTAGTAACGT \ TTCTGTTCTA TAGACAGTTT GGATAACTAG 960 

CTCTTACTTT TTATCTTTAA AAACTGTTTT ITCCAGTGAAG TTACGTATAA TTATTTACTT 1020 

CAAGCGTAGT ATACCAAATT ACTTTAGAAA TGCAAGACTT TTCTTATACT TCATAAAATA 1080 

CATTATGAAA GTGAATCTTG TTGGCTGTGT ACATTTGACT ATAATAATTT CAATGCATAT 1140 

TATTTCTATT GAGAGTAAGT TACAGTTTTT GGCAAACTGC GTTTGATGAG GGCTATCTCC 1200 

TCTTCCTGTG CGTTTCTAAA ACTTGTGATG CAAACGCTCC CACCCTTTCC TGGGAACACA 1260 

GAAAGCCTGA CTCAGGCCAT GGCCGCTATT AWVGCAGCTC CAGCCCTGCG CACTCCCTGC 1320 

TGGGGTGAGC AGCACTGTAA AGATGAAGCT GGTTAACTGG TACTGGCTGA GCTCAGCTGT 13 80 

TCTTGCCACT TACGGTTTTT TGGTTGTGGC AAMTAATGAA ACAGAGGAAA TTAAAGATGA 1440 

AAGAGCAAAG GATGTCTGCC CAGTGAGACT AGaIaGCAGA GGGAAATGCG AAGAGGCAGG 1500 

GGAGTGCCCC TACCAGGTAA GCCTGCCCCC CTTGACTATT CAGCTCCCGA AGCAATTCAG 1560 

CAGGATCGAG GAGGTGTTCA AAGAAGTCCA AAACCTCAAG GAAATCGTAA ATAGTCTAAA 1620 

GAAATCTTGC CAAGACTGCA AGCTGCAGGC TGATttACAAC GGAGACCCAG GCAGAAACGG 1680 

ACTGTTGTTA CCCAGTACAG GAGCCCCGGG AGAGGTTGGT GATAACAGAG TTAGAGAATT 1740 

AGAGAGTGAG GTTAACAAGC TGTCCTCTGA GCTAAR.GAAT GCCAAAGAGG AGATCAATGT 1800 

ACTTCATGGT CGCCTGGAGA AGCTGAATCT TGTAAATATG AACAACATAG AAAATTATGT 1860 

TGACAGCAAA GTGGCAAATC TAACATTTGT TGTCAMTAGT TTGGATGGCA AATGTTCAAA 1920 

GTGTCCCAGC CAAGAACAAA TACAGTCACG TCCAGGTATG TATAATAATG TTTTCTTATC 1980 

ATATGTTCAT AAATGTTATA CAGTCAGAGA TGTATCnAAA AGATTAACCT GAGTCAGTAA 2040 

GTTAAATAGA TGACAGATTA AGTCTTTTAT TTATCAAGGT GCACAGGAAA AAATAAATAT 2100 

CTTCTCAAAT ATGACCACAT AAATATGACC TAATTACAAA ATCATAGTTA GTTCTGTATC 2160 

CACTGGAAGT CACTTTCAAT TTTAAGATCT TATTTGTTRA TGCCAGACCT ACTTGCAAGC 2220 

AGAGATTAGA GGTCCTTTCT GCTTTATAAC ATTAGGTTOT TCTTGTGAGG CCTTAAGCAT 2280 

TTACTAAACA CCTTCAAGTA AGTTTAGTAA AGTTTCATTA CTGCCATTGA TTCAATTATC 2340 
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AAACTGCTTT 


TGTACATATA 


AAGAATTfcTT 


CAGATGCATG 


GTTTCTATTA 


ACAAGATCCA 


2400 


ATGCCTTCCT 


TTTATTTCCC 


CTTCAGT r DCA 


ACATCTAATA 


TATAAAGATT 


GCTCTGACTA 


2460 


CTACGCAATA 


GGCAAAAGAA 


GCAGTGAGAC 


CTACAGAGTT 


ACACCTGATC 


CCAAAAATAG 


2520 


TAGCTTTGAA 


GTTTACTGTG 


ACATGGAGAC 


CATGGGGGGA 


GGCTGGACAG 


TGCTGCAGGC 


2580 


ACGTCTCGAT 


GGGAGCACCA 


ACTTCACcAj 


AACATGGCAA 


GACTACAAAG 


CAGGCTTTGG 


2640 


AAACCTCAGA 


AGGGAATTTT 


GGCTGGGGAA 


CGATAAAATT 


CATCTTCTGA 


CCAAGAGTAA 


2700 


GGAAATGATT 


CTGAGAATAG 


ATCTTGAAGA\ 


CTTTAATGGT 


GTCGAACTAT 


ATGCCTTGTA 


2760 


TGATCAGTTT 


TATGTGGCTA 


ATGAGTTTCT 


\cAAATATCGT 


TTACACGTTG 


GTAACTATAA 


2820 


TGGCACAGCT 


GGAGATGCAT 


TACGTTTCAA 


IAAACATTAC 


AACCACGATC 


TGAAGTTTTT 


2880 


CACCACTCCA 


GATAAAGACA 


ATGATCGATA 


TipCTTCTGGG 


AACTGTGGGC 


TGTACTACAG 


2940 


TTCAGGCTGG 


TGGTTTGATG 


CATGTCTTTC 


T3CAAACTTA 


AATGGCAAAT 


ATTATCACCA 


3000 


AAAATACAGA 


GGTGTCCGTA 


ATGGGATTTT 


CTpGGGTACC 


TGGCCTGGTG 


TAAGTGAGGr 


3060 


ACACCCTGGT 


GGCTACAAGT 


CCTCCTTCAA 


AGAGGC TAAG 


ATGATGATCA 


GACCCAAGCA 


3120 


CTTTAAGCCA 


TAAATCACTC 


TGTTCATTCC 


TCcbvGGTATT 


CGTTATCTAA 


TAGGGCAATT 


3180 


AATTCCTTGT 


TTCATATTTT 


TCATAGCTAA 


AAAATGATGT 


CTGACGGCTA 


GGTTCTTATG 

x x v— • x x ** x x-j 


3240 


CTACACAGCA 


TTTGAAATAA 


AGCTGAAAAA 


CAAlpCATTT 


TAAAGGAGTC 


TTTTRTTttTT 

X X X VJJ X X VJ X X 


3300 
j j \j \j 


ATGCTGTTAT 


CCAATGAACA 


CTTGCAAGCA 


ATTACCAATA 


TTGAGAATTA 

X X wnwrui X X 


TACATTAGAT 

x x x frvjii x 


3360 


TTACAATTCT 


TTTAATTTCT 


ATTGAAACTT 


TTTCT1ATTGC 


TTGTATTACT 


TGPTGTATTT 

X WV X VJJ x r\ XXX 


3420 


AAAAAATAAT 


TGTTGGCTGG 


GTGTGGTAGC 


TCACGCCTGT 


AATCCCAGCA 


CTTTGGAATG 

x x x vjvjrui x \3 


3480 


TCAAGGCAGG 


CAGATCACTT 


GAGGTCAGGA 


GTTTGaGACC 


AGCCTGGCCA 


AACATGTGAA 


3540 


ACGCTGTCTC 


TATTAAAAAT 


ACAAAAATTA 


GCCGGGCATG 


GTGGTACATG 


CCTGTAATCA 


3600 


ACGCTGTTTA 


TTAAAAATAC 


AAAAATTAGC 


CGGGCAipGT 


GGACATGCCT 


GTAATCCTAG 


3fifi0 


TACTTGGGAG 


GCTGAGGCAG 


GAGAATCGCT 


TGAACCTKAG 


AGGAAGAGGT 


TGCAGTGAGC 




CAAGAATGAG 


CCACTGCACT 


CCAGCATGGG 


tgacagagWa 


AACTCTGTCT 


CAAAPAAAAA 


nan 


AATAATAAAA 


TTTATTCAGT 


AGGTGGATTC 


TACACAAA&T 


AATCTCJTATT 

xl«i\ X V— ' X X X X 


TGGGfTATfiA 




TTTAAGCACA 


TCTGAAGGTA 


TATCACTCTT 


TTCAGGCTAlT 


AATTATTTGG 


GTAATCTTPA 

VJ X iui X ^ X X i^n 


J^UU 


TTCTGAGACA 


AACTTAATCT 


ATATCATTTA 


CTTTGCAACA 


GAACAACCCT 


rtv-rlVJ^- rt, X 1 X 1 








1 AA 1 A 1CTAT 


ATAATTAAACT 


TTGTTCATTT 


ATCATTCATG 


4020 


AAATATAAAA 


TACTTGTCAT 


TTAAACCGTT 


TAAAAATGTG | 


IGTAGCATAAT 


GTCACCCCAA 


4080 


AAAGCATTCA 


GAAAGCAATG 


TAACTGTGAA 


GACCAGGGTT 


ITAAAGGTAAT 


TCATTTATAG 


4140 


TTTATAACTC 


CTTAGATGTT 


TGATGTTGAA 


AACTGCTTTA 


ACATGAAAAT 


TATCTTCCTC 


4200 


TGCTCTGTGT 


GAACAATAGC 


TTTTAATTTA 


AGATTGCTCA 


CTTACTGTACT 


AGACTACTGG 


4260 


TAGGTTTTTT 


TGGGGGGGGG 


TGGGTAGGGA 


TATGTGGGTA 


ATGAAGCATT 


TACTTACAGG 


4320 
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CTATCATACT CTGAGGCCAA TTTTATCiCC AAAGCAATAA TATCATTAAG 
CATAGAAGGC TAAGTTTCTC TAGGACAGAT AGAAAACATG AATTTTGAAA 
AGTAGTTAAA ATACTATATA TTTCAACCuT GGCTGGTAGA TTGCTTATTT 
AACTAAAAGA TAGATTTTTA CCCAAACAGA AGTATCTGTA ATTTTTATAA 
CTGGAATGCT ATATATAATA TTTAAAAGA9 TTTTTAAATG TGTTTAATTT 
AAAAGGGATC 

( 2 ) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 439 amino cjcids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : singl^ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



TGATTCACTT 
TATATAGAAC 
TACTATCAGA 
TTCATCAATT 
CATCATCGTA 



4380 
4440 
4500 
4560 
4620 
4630 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Leu Ala Asn Trp Tyr Ttp Leu Ser Ser Ala Val Leu Ala Thr 
1 5 \ 10 15 

Tyr Gly Phe Leu Val Val Ala AsVi Asn Glu Thr Glu Glu lie Lys Asp 
20 \ 25 30 

Glu Arg Ala Lys Asp Val Cys Prd Val Arg Leu Glu Ser Arg Gly Lys 
35 40 I 45 

Cys Glu Glu Ala Gly Glu Cys Pro 1 Tyr Gin Val Ser Leu Pro Pro Leu 
50 55 I 60 

Thr lie Gin Leu Pro Lys Gin Phe Iser Arg lie Glu Glu Val Phe Lys 
65 70 \ 75 80 

Glu Val Gin Asn Leu Lys Glu lie tal Asn Ser Leu Lys Lys Ser Cys 
85 1 90 95 

Gin Asp Cys Lys Leu Gin Ala Asp aIsp Asn Gly Asp Pro Gly Arg Asn 
100 1P5 110 

Gly Leu Leu Leu Pro Ser Thr Gly Aia Pro Gly Glu Val Gly Asp Asn 
115 120 \ 125 

Arg Val Arg Glu Leu Glu Ser Glu Va\. Asn Lys Leu Ser Ser Glu Leu 
130 135 \ 140 

Lys Asn Ala Lys Glu Glu lie Asn Val\ Leu His Gly Arg Leu Glu Lys 
145 150 \ 155 160 

Leu Asn Leu Val Asn Met Asn Asn lie iGlu Asn Tyr Val Asp Ser Lys 
165 b.70 175 

Val Ala Asn Leu Thr Phe Val Val Asn fcer Leu Asp Gly Lys Cys Ser 
180 185 \ 190 

Lys Cys Pro Ser Gin Glu Gin He Gin sWr Arg Pro Val Gin His Leu 
195 200 \ 205 

He Tyr Lys Asp Cys Ser Asp Tyr Tyr hia He Gly Lys Arg Ser Ser 
210 215 \ 220 
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Glu Thr Tyr Arg Val Thr Pfo Asp Pro Lys Asn Ser Ser Phe Glu Val 
225 230 235 240 

Tyr Cys Asp Met Glu Thr M^t Gly Gly Gly Trp Thr Val Leu Gin Ala 



Arg Leu Asp Gly Ser Thr Ain Phe Thr Arg Thr Trp Gin Asp Tyr Lys 
260 265 270 

Ala Gly Phe Gly Asn Leu A^g Arg Glu Phe Trp Leu Gly Asn Asp Lys 
275 280 285 

lie His Leu Leu Thr Lys Sar Lys Glu Met lie Leu Arg He Asp Leu 
290 295 300 

Glu Asp Phe Asn Gly Val Glju Leu Tyr Ala Leu Tyr Asp Gin Phe Tyr 
305 310 315 320 

Val Ala Asn Glu Phe Leu Lyfe Tyr Arg Leu His Val Gly Asn Tyr Asn 
325 330 335 

Gly Thr Ala Gly Asp Ala Lei Arg Phe Asn Lys His Tyr Asn His Asp 
340 345 350 

Leu Lys Phe Phe Thr Thr Prd Asp Lys Asp Asn Asp Arg Tyr Pro Ser 
355 360 365 

Gly Asn Cys Gly Leu Tyr Tyr Ser Ser Gly Trp Trp Phe Asp Ala Cys 
370 375 380 

Leu Ser Ala Asn Leu Asn Gly I Lys Tyr Tyr His Gin Lys Tyr Arg Gly 
385 390 I 395 400 

Val Arg Asn Gly He Phe Trp Gly Thr Trp Pro Gly Val Ser Glu Ala 
405 1 410 415 

His Pro Gly Gly Tyr Lys Ser Eer Phe Lys Glu Ala Lys Met Met He 
420 1 425 430 

Arg Pro Lys His Phe Lys Pro I 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : I 

(A) LENGTH: 5403 base pairs 

(B) TYPE: nucleic acid T 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 1 

(ii) MOLECULE TYPE: other nucletc acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CATAAGGCGT GTCTGACAAA TTCTTCATAC ACACATTTCC CCTTTGCACA TTCAGTCTGT 60 

ATAGGTTATT TCTATAGGAG AAAAAAAATA TTcWaATTCC TTGTGCACTG GTAACAGGCA 120 

TGAAGGC TCA GCAAAGCCAA TACGTGTTAT GTcAaGTTGG AGACAGTGCC AGGGCCAACA 180 

TTCCAGACTT CTCAGATAGA AAGTGCGCCT GCClfccCCTG CTCTGAGAAT TTGAAGAGAG 240 

TAGTTCAGTT AGAATTAAGA GGCAGTAGAG AAAA&TCTTG GGAAATCTGG TTAGAGATAT 300 

AAATATGAGA ACTGGACATG GTGGTACACA CCTffiGATCT CTGTGTTTAG GAGGGAGAGG 360 



245 



250 



255 



435 
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CAGAGAGATC 


AGGAGTTCAA 


GGCCAGCCTG 


AGCTACTTGA 


GACCCAGTCT 


AAATAAATAA 


420 


GAGATAGATT 


ACAGAGTGCC 


TTTAACrAGT 


ACAGAGAAAG 


AATTTGGGTT 


TATCTGTGTC 


480 


AGTTACGCTG 


AAATAATTTT 


TAAGTAATAA 


AATCCCTTTT 


AATAAGAAAC 


CTTATGAGGT 


540 


CAGTATGCAC 


AATGAACTTA 


AGAGAGACCC 


CCAGCTCCTG 


AGCTGAGTGA 


TGGGGAAGGA 


600 


CAGCCACTGC 


CTGTGATGTG 


TGAGTGACGT 


GCTTCCAAGT 


GTTTTAACCA 


CTGACGATTA 


660 


CATAGCCTGC 


ACAGTCAGGA 


GAAAACAaCC 


GTATTCTCTG 


CCAGTTCTCT 


TCCCTTTTAC 


720 


AAACAGATGA 


GAGACACACA 


CAGAGAATCC 


ATTTAAAGAG 


CGGACCTTTG 


TTCTGATTAG 


780 


GGGCAATTTT 


AAGTACTTAA 


GAGTTCACAC 


AAAGTCTAGC 


CTTCAAAAAG 


AAAACAGGTT 


840 


CCCAAACTAG 


GGAGGAAACA 


GAATCATT'aC 


CATTTTGGTG 


ACATTTAGTG 


GGAAGAAGCT 


900 


CACAGACATT 


TAGACGTTCC 


AACTCTTTCC 


CCACTAGTGG 


ACCAAGTATA 


TAATATGGTA 


960 


TCTTTTGGGC 


ACTGGTATTA 


CAACTGTTTT 


TTAAACAAAA 


GACTTTCCTT 


GTGCTTTACT 


1020 


AAAAACCCAG 


ACGGTGAATC 


TTGAATACAA 


TGCGTGGCAC 


CCACGGCAGG 


CATTCTATTG 


1080 


TGCATAGTTT 


TGACTGACAG 


GAGATGACAGj 


CATTTGGCTG 


GCTGCGCTTG 


CTGAGGACCC 


1140 


TCTCCTCCTG 


TGTGGCGTCT 


GAGACTGTGA 


i TGCAAATGCG 


CCCGCCCTTT 


TCTGGGAACT 


1200 


CAGAACGCCT 


GAGTCAGGCG 


GCGGTGGCTA 


VTTAAAGCGCC 


TGGTCAGGCT 


GGGCTGCCGC 


1260 


ACTGCAAGGA 


TGAGGCTTCC 


TGGTTGGTTG 


TGGCTGAGTT 


CTGCCGTCCT 


CGCTGCCTGC 


1320 


CGAGCGGTGG 


AGGAGCACAA 


CCTGACTGAG 


QGGCTGGAGG 


ATGCCAGCGC 


CCAGGCTGCC 


1380 


TGCCCCGCGA 


GGCTGGAGGG 


CAGCGGGAGG 


TCCGAGGGGA 


GCCAGTGCCC 


CTTCCAGCTC 


1440 


ACCCTGCCCA 


CGCTGACCAT 


CCAGCTCCCG 


CQGCAGCTTG 


GCAGCATGGA 


GGAGGTGCTC 


1500 


AAAGAAGTGC 


GGACCCTCAA 


GGAAGCAGTG 


GACAGTCTGA 


AGAAATCCTG 


CCAGGACTGT 


1560 


AAGTTGCAGG 


CTGACGACCA 


TCGAGATCCC 


ggAgggaatg 


GAGGGAATGG 


AGCAGAGACA 


1620 


GCCGAGGACA 


GTAGAGTCCA 


GGAACTGGAG 


AGTlCAGGTGA 


ACAAGCTGTC 


CTCAGAGCTG 


1680 


AAGAATGCAA 


AGGACCAGAT 


CCAGGGGCTG 


CAGGGGCGCC 


TGGAGACGCT 


CCATCTGGTA 


1740 


AATATGAACA 


ACATTGAGAA 


CTACGTGGAC 


AACAAAGTGG 


CAAATCTAAC 


CGTTGTGGTC 


1800 


AACAGTTTGG 


ATGGCAAGTG 


TTCCAAGTGT 


CCCAGCCAAG 


AACACATGCA 


GTCACAGCCG 


1860 


GGTAGGTGTA 


ATGAGGGTCA 


TACAGTTTGT 


tcatAaaagc 


TGTATAGCCA 


GATAGTGGCC 


1920 


ATAAACATTA 


ACCCGAGGGA 


GCATAAGTTA 


gtcagWcttt 


CACCTGTTAA 


GTTATGGCAG 


1980 


GAGAAACAAG 


TGTTTTCTCA 


AATGAGACAA 


C A A A A TftinT 


AAA TP 7\ TIT 1 J\ 


Lbl ALAAAAA 


2040 


TCCTATTAGT 


TGTACTCGTT 


AGAGACCGTC 


ACTTGckAGT 


CTCTAGACCT 


TCCCTGCTAG 


2100 


GTCGACCAAC 


AGACGAGCAG 


AAACAGATTC 


CTCCCGOAAT 


CTGAACACAT 


ATTTGAACAC 


2160 


AGGACAGGTA 


TGGCAAGGTT 


CCTGGCTCTG 


CTTGCTTaGG 


TCCCTGGGAA 


TCAGATCTTG 


2220 


GGTGGCTGAT 


GGGCTTTATA 


AGGCTTTCAC 


AAACAATOCTG 


CTGTGCTAGG 


TTCTCAAATA 


2280 


TCTAGTGAGA 


ATGGGAGATT 


TTTATACATG 


GAAGCATCRC 


TCCTCTCTCT 


CTCCTCTCTC 


2340 
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CTCTCTCTTC 


TCTCTCTCTC 


TCTCTOTCTC 


TCTCTCTCTC 


TCTCTCTCTC 


TCTCTCTCTC 


2400 


CTCCCTCCCT 


CCCTCTCTCT 


CTCTTTGjTGT 


GCGTGTGTGG 


TGGGGATGAG 


GACACGTGTA 


2460 


GAACTTCGGG 


GGTTGAGACT 


TAGTGCATAT 


GCATCCTCAC 


CATTCCAGTT 


AGTGAATGTT 


2520 


AACACTATTT 


AAGGTCACAG 


ACCTAAcAjC 


CTTCTGTGTC 


CGGATTCCTG 


GATTCCTAGG 


2580 


ACCTTTGTGG 


ATGGGTTGCC 


ACACCCTCTG 


TGTTCATCCT 


GACTGTGAGG 


TCGATGGGAC 


2640 


ATAGTAGGGA 


TAACTTTCAT 


TTGGAATCTC 


TAGAGATGGT 


AGGTCATCAT 


GTCATAGAAT 


2700 


GTTATCACTA 


ATGACCAAGA 


TAGAC AC TC A 


TGTTTAAGAG 


ACATCACAAG 


GTGTATATTA 


2760 


AATATGACAT 


GGCATATAAC 


TTGTAATGACl 


ACAAAAATAT 


TCTGTTACCT 


ACTTTTCTCC 


2820 


TAAAAGCTTG 


GGACTCTCCA 


GAGTTCTAAA 


Itacatgcaaa 


CAGATTATTG 


TGTTTTACAG 


2880 


GAATCTTATA 


TTGAACTTTC 


TTTACCTGAC 


Itcaaatttta 


TTAAAATTAA 


CTGGGAACAA 


2940 


ATAGTTGGTC 


TCTAATCTCT 


ACAAAAACCA 


Iccaaatgatt 


ACACTGAGCA 


TAATTATAAT 


3000 


CACCCTGCTG 


CTACGTCTAG 


AAACCAAACT 


gtgaaatatt 


GGCTGACTGT 


ATACCTTCCT 


3060 


AAATAATAAA 


TTCAGGATAA 


CATTGCCATA 


ITTATTGGAGA 


ACCCCCCCCT 


CCCTTTTAAA 


3120 


ACTGGAATCA 


TTTTATGTCA 


ATCTCAGGTG 


AAATACGAAT 


GGGTTTCAGA 


ACAGTGCTGT 


3180 


GCACTGAAGG 


CTGACATTTA 


GAACATATAT 


MACGATTTCT 


GTAAAGTCTG 


CTGTAACAAT 


3240 


TGCTGATTGT 


ATCCTAGGAG 


ACTTGGACTC 


CffCTCAACGT 


TAAGGCAGAG 


GAATATAATG 


3300 


GTTATGAGAG 


TAAAACTCTC 


TGTCAGGTAC 


ATCTGGCTTT 


CTGTCCCAGC 


TCTGTCACTT 


3360 


AACACTTAGT 


TGCGGTGGGA 


AAACTCCCTG 


ATCTTCCGGG 


AGAC TAAGTA 


ACTGTATAAG 


3420 


CAAGCTGGCC 


GTGATATCCA 


CGTCGTAAGG 


CTCCTGTGTG 


GGTTCAGTGA 


AAACTGTTAC 


3480 


AGTGATTGGC 


AGAGTTTCTG 


GAGGTCATTG 


ACqCTCATTA 


AACCTTGCAT 


ACACTTATTC 


3540 


TTACTACTCT 


TTGCTGTTAG 


TGTTGCCACC 


aggWttgcca 


TTCAAGGCAG 


TCCTGTATAC 


3600 


TTGATAACAC 


CAGTTGGTTC 


TGAGGCCTTA 


GTTAGCATCT 


GTTAGCCTGG 


TTCAGGAGAG 


3660 


TGTATCAGAG 


CCAGGTTCCT 


CTATCACATA 


AACTGTAACG 


CAAGTGAATT 


GTCCAATTGC 


3720 


TGTTGAGTCT 


GAGAGTCCTT 


GAGGTGCATA 


GCTTTGACTA 


ATAAATCCCC 


ATGCTTTTAT 


3780 


GCTTTTCCTT 


CCTCCCTCTT 


CCAGTTCAAC 


atctaWtata 


CAAAGATTGT 


TCCGACCACT 


3840 


ACGTGCTAGG 


AAGGAGAAGC 


AGTGGGGCCT 


ACAGAaTTAC 


CCCTGATCAC 


AGAAACAGCA 


3900 


GCTTTGAGGT 


CTACTGTGAC 


ATGGAGACCA 


TGGGTGGAGG 


CTGGACGGTG 


CTGCAGGCTC 


•a q c n 

J70U 




A VjV- ^ aav* 




AGTGGAAftGA 


CTACAAAGCC 


GGCTTTGGAA 


4020 


ACCTTGAACG 


AGAATTTTGG 


TTGGGCAACG 


ATAAAATTCA 


TCTTCTGACC 


AAGAGTAAGG 


4080 


AAATGATTTT 


GAGAATAGAT 


CTTGAAGACT 


TTAATGGTCT 


CACACTTTAT 


GCCTTGTATG 


4140 


ATCAGTTTTA 


TGTGGC TAAT 


GAATTTCTCA 


AATACCGATT 


ACACATCGGT 


AACTACAATG 


4200 


GCACGGCAGG 


GGATGCCTTG 


CGTTTCAGTC 


GACACTACAR 


CCATGACCTG 


AGGTTTTTCA 


4260 


CAACCCCAGA 


CAGAGACAAC 


GATCGGTACC 


CCTCTGGGAA 


CTGTGGGCTC 


TATTACAGCT 


4320 
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CAGGCTGGTG GTTTGATTCA TGTcVcTCTG CCAATTTAAA 
AATACAAAGG TGTCCGTAAT GGGATtTTTCT GGGGCACCTG 
AGCCAGGTGG CTACAAGTCC TCCTTIAAAC AGGCCAAGAT 
TCAAGCCATA AATTGCTAGT GTTCATCTCT CTGGGCACTC 
ATTCCTTCAG CCCTTTACCA TATGTCTCAG TTTATATTCC 
CCTTTAAAGC TTTACAGCTT TTAGAATjAAA GCTGAAAAGA 
GCTGTTATAT GAGGAATGCT TGAAAGckcT GGAAATATTG 
CAAAACCTTT CATTTTTATT AGTTGAAAAG TTTCCTAATA 
AAAACTAAAT TATTCAGCAA GCTAGATTCT ATATACGCAA 
AAATATACAC ATTTGAGAAT ATACCAGTQC TTCCAGGTAC 
AGTATTATCT TTCGTCTAAG AAGAACTTAA AGCATTTTAG 
GATGGGATTG GGGGCCAGGG ACAATATGTfl TAGCTAAATG 
TGGCATTAAA ATACCTAAAA ATGTGGTAGCl ATAATATATG 
AAAATAATGT TACCCTGTAG ACTTTGGTTT lAGTGGTAATT 
TTAGACCGCG ATACAAAAGC TGCTTTATCC TCTCCCTCTG 
GTGATGTAAG GTGCTAGACT ACTGTAAGGT TTCCTTGGGG 
CACACTGGTT TATATTTTGA AAGCCAATCC TAATCCCAAA 
TCAACGTTCT AGGAAGCTGA CTTTTCTAGA AqAAATGTAT 
ATT 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 amino acic 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



TGGCAAATAT 
GCCTGGTATA 
GATGATTAGG 
ACTATCTAAG 
TTTCCTATGG 
TCTAAAAAGA 
ACAATTATAC 
TTTTTATTAT 
GTTTTATTTT 
AACTGAAAGC 
TTCTCAAGAA 
TATTCATCTA 
TCTCTTCCCT 
CACTTACTGT 
CTCTCTGTGC 
AAAGGCATGG 
GCAATACTGT 
TTATTAGGAT 



TACCACCAGA 
AACCAGGCAC 
CCCAAGAATT 
AGGACGATGA 
CTAAACATTT 
CTCCTATGTT 
ATTATAATTG 
TTTTATAATA 
CACTAGGGCT 
CAAGAACTGT 
GAAGGGCAGG 
ATGCAAAATA 
CTCCAATTGA 
TTATAGCCTG 
ACAATGGTTT 
TAAGGGAAAA 
TGTCGAGGAG 
GAATTTGGGA 



4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5403 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:4: 

Met Arg Leu Pro Gly Trp Leu Trp L4u Ser Ser Ala Val Leu Ala Ala 
1 5 \ 10 15 

Cys Arg Ala Val Glu Glu His Asn Leli Thr Glu Gly Leu Glu Asp Ala 
20 25\ 30 

Ser Ala Gin Ala Ala Cys Pro Ala Arg\ Leu Glu Gly Ser Gly Arg Cys 
35 40 \ 45 

Glu Gly Ser Gin Cys Pro Phe Gin Leu Vrhr Leu Pro Thr Leu Thr He 
50 55 \ 60 

Gin Leu Pro Arg Gin Leu Gly Ser Met <\lu Glu Val Leu Lys Glu Val 
65 70 \ 75 80 
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Arg Thr Leu Lys Glu Al\a Val Asp Ser Leu Lys Lys Ser Cys Gin Asp 
85 \ 90 95 

Cys Lys Leu Gin Ala Asp\Asp His Arg Asp Pro Gly Gly Asn Gly Gly 
100 \ 105 110 

Asn Gly Ala Glu Thr Ala blu Asp Ser Arg Val Gin Glu Leu Glu Ser 
115 1 120 125 

Gin Val Asn Lys Leu Ser Aer Glu Leu Lys Asn Ala Lys Asp Gin lie 
130 lb5 140 

Gin Gly Leu Gin Gly Arg Lfeu Glu Thr Leu His Leu Val Asn Met Asn 
145 150 \ 155 160 

Asn lie Glu Asn Tyr Val Asb Asn Lys Val Ala Asn Leu Thr Val Val 
165 1 170 175 

Val Asn Ser Leu Asp Gly Lys\ Cys Ser Lys Cys Pro Ser Gin Glu His 
180 \ 185 190 

Met Gin Ser Gin Pro Val Gln\His Leu lie Tyr Lys Asp Cys Ser Asp 
195 \200 205 

His Tyr Val Leu Gly Arg Arg feer Ser Gly Ala Tyr Arg Val Thr Pro 
210 215 \ 220 

Asp His Arg Asn Ser Ser Phe G\Lu Val Tyr Cys Asp Met Glu Thr Met 
225 230 1 235 240 

Gly Gly Gly Trp Thr Val Leu Gin Ala Arg Leu Asp Gly Ser Thr Asn 
245 1 250 255 

Phe Thr Arg Glu Trp Lys Asp Tyt Lys Ala Gly Phe Gly Asn Leu Glu 
260 I 265 270 

Arg Glu Phe Trp Leu Gly Asn Asd Lys lie His Leu Leu Thr Lys Ser 
275 2801 285 

Lys Glu Met He Leu Arg He Aspl Leu Glu Asp Phe Asn Gly Leu Thr 
290 295 \ 300 

Leu Tyr Ala Leu Tyr Asp Gin Phe Vryr Val Ala Asn Glu Phe Leu Lys 
305 310 \ 315 320 

Tyr Arg Leu His He Gly Asn Tyr Asn Gly Thr Ala Gly Asp Ala Leu 
325 \ 330 335 

Arg Phe Ser Arg His Tyr Asn His Asfc Leu Arg Phe Phe Thr Thr Pro 
340 345 350 

Asp Arg Asp Asn Asp Arg Tyr Pro Ser\ Gly Asn Cys Gly Leu Tyr Tyr 
355 360 \ 365 

Ser Ser Gly Trp Trp Phe Asp Ser Cys \Leu Ser Ala Asn Leu Asn Gly 
370 375 \ 380 

Lys Tyr Tyr His Gin Lys Tyr Lys Gly \\al Arg Asn Gly He Phe Trp 
385 390 \ 395 400 

Gly Thr Trp Pro Gly He Asn Gin Ala Gin Pro Gly Gly Tyr Lys Ser 
405 41& 415 

Ser Phe Lys Gin Ala Lys Met Met He ArA Pro Lys Asn Phe Lys Pro 
420 425 \ 430 



